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PROMOTION OF HOMOLOGOUS DNA PAIRING BY RecA-OERIVEO PEPTIDES 

Field of the invention 

The present invention relates to peptides capable of promoting homologous DNA pairing and the use of the 
compositions containing the peptides in promoting such DNA pairing. More specifically, the invention relates to short 
5 peptides derived from the £ cofi recA protein which are capable of catalyzing homologous DNA pairing. 

Background of the Invention 
RecA proteins play an essential role in homologous DNA recombination pathways (Cox et aL, Bioessays, 15:617- 
6;!3, 1993; Kowalczykowski et a!., Ann. Rev. Bfachem., 63:991-1044, 1994]. The first RecA protein was purified from 
£ coli in 1965 by Clark et al. \Proc. Natl. Aced. Sci. USA, 53:451, 1965} and since then has been isolated and cloned 
10 from many other prokaryotic organisms (Roca et al„ BkichBm* Mol Bio!., 25:415455, 1990). The RecA protein is a 
3D kD polypeptide which possesses a multitude of biochemical activities. 

RecA promotes homologous DNA recombination via a multi-step pathway involving the formation of a single- 
stranded nucleoprotein filament, DNA pairing and subsequent strand exchange to form heteroduplexDNA. RecA is directly 
involved in post-replication DNA repair and the induction of the SOS response by cleavage of the lexA repressor protein. 
15 RecA mediates a set of DNA strand exchange reactions m vitro by self-assembly into filaments, binding to both single- 
stranded DNA (ssDNA) and double-stranded DNA (dsDNA), and hydrolysis of ATP. Although the crystal structure of the 
protein has been solved (Story et al.. Nature, 355:318-325, 19921, the precise domain(s) which catalyze the pairing of 
tvo homologous DNA molecules are unknown. RecA binds to ssDNA, forming a nucleoprotein (presynaptic) filament that 
is >:he homology searching moiety that mediates the pairing with a target DNA duplex. 
20 Summary of the Invention 

Gne embodinent of the present invention is a peptide containing the amino acid sequence shown in SEQ ID N0:1 
or conservative amino acid substitutions thereof, wherein the amino acids at positions 1, 2, 4, IS and 20 of SEQ ID NO: 
1 ;:re asparagine, glutamine, arginine, glycine and glycine, respectively, the peptide being capable of promoting pairing 
of a single-stranded DNA molecule and a double-stranded DNA molecule, wherein the single-stranded DNA molecule is 
25 homologous to at least a portion of the double-stranded DNA molecule. In one aspect of this preferred embodiment, in 
thi: peptide: 

position 1 is asparagine; 
position 2 is glutamine 

position 3 is isoleucine, valine, tyrosine, threonine, leucine or methionine; 
3D position 4 is arginine 

position 5 is isoleucine, phenylalanine, valine, tyrosine, threonine, tryptophan, leucine, cysteine, 
methionine, histidine, arginine, glutamic acid, glutamine, serine, asparagine or alanine; 

position 6 is isoleucine, valine, threonine, leucine, methionine, histidine, arginine, lysine glutamine, 
serine, esparagine, alanine or glycine; 
35 position 7 is isoleucine, phenylalanine, valine, tyrosine, leucine, cysteine, or methionine; 

position 8 is valine, tyrosine, threonine, tryptophan, leucine, cysteine, methionine, histidine, arginine, 
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lysinE, glutamine, serine, ssparagine, alanine, proline or glycine; 

position 9 is isoleucine, phenylalanine, valine, tyrosine, threonine, tryptophan, leucine, methionine, 
histidine, arginine, lysine, aspartic acid, glutamic acid, glutamine, serine, alanine, proline or glycine; 

position 10 is phenylalanine, valine, tyrosine, threonine, tryptophan, leucine, cysteine, methionine, 
histidine, lysine, aspartic acid, glutamic acid, glutamine, serine, alanine, proline or glycine; 

position 1 1 is isoleucine, phenylalanine, valine, tyrosine, threonine, tryptophan, leucine, cysteine, 
methionine, histidine, lysine, glutamine, serine, asparagine, alanine, proline or glycine; 

position 12 is threonine, cysteine, methionine, histidine, lysine, glutamine, serine, asparagine or glycine; 

position 13 is threonine, cysteine, aspartic acid, serine, asparagine, alanine or proline; 

position 14 is isoleucine, phenylalanine, valine, tyrosine, threonine, tryptophan, leucine, cysteine, 
methbnine, histidine, arginine, lysine, aspartic acid, glutamic acid, glutamine, serine, asparagine, alanine, proline 
or glycine; 

position 15 is tryptophan or glutamic acid; ( 
position 16 is isoleucine, phenylalanine, valine, tyrosine, threonine, cysteine, methionine, aspartic acid, 
glutamine, serine, asparagine or alanine; 
position 17 is threonine; 

position 18 is valine, threonine, cysteine or serine; 
position 19 is glycine; and 
position 20 is glycine. 
According to another aspect of this preferred embodiment, in the peptide: 
position 1 is asparagine; 
position 2 is glutamine; 

position 3 is isoleucine, tyrosine, threonine, tryptophan, leucine, or methionine; 
position 4 is arginine; 

position 5 is isoleucine, phenylalanine, .valine, tyrosine, threonine, tryptophan, leucine, cysteine, 
methionine, histidine, glutamine, serine or asparagine; 
position 6 is arginine or lysine; 
position 7 is isoleucine, phenylalanine, leucine or methionine; 

position 8 is valine, tyrosine, threonine, leucine, methionine, histidine, arginine, lysine, glutamic acid, 
glutamine, serine, asparagine, alanine or glycine; 

position 9 is isoleucine, phenylalanine, valine, tyrosine, threonine, leucine, cysteine, methionine, 
histidine, glutamine, serine, asparagine or proline; 

position 10 is valine, tyrosine, threonine, tryptophan, cysteine, methionine, histidine, lysine, glutamine, 
serine, asparagine, alanine or glycine; 

position 11 is isoleucine, phenylalanine, valine, tyrosine, tryptophan, leucine, cysteine, methionine or 

histidine; 
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position 12 is cysteine, histidine, serine or alanine; 
position 13 is cysteine, serine or alanine; 

position 14 is isoleucine, phenylalanine, valine, tyrosine, threonine, leucine, cysteine, methionine, 
histidine, glutamine, serine, asparagine, alanine, proline or glycine; 
5 position 15 is tryptophan or glutamic acid; 

position 16 is isoleucine, valine, threonine, cysteine or asparagine; 
position 17 is threonine; 

position 18 is valine, threonine, cysteine, serine or proline; 
position 19 is glycine; and 
10 position 20 is glycine. 

Preferably, the peptide has the amino acid sequence shown in SEQ ID N0:1. Alternatively, the peptide has an 
amino acid sequence shown in SEQ ID N0:2 or SEQ ID NQ:3. 

Another embodiment of the invention is a method for selectively targeting a single-stranded DNA molecule to 
a double-stranded DNA molecule containing at least a portion of a sequence homologous to the single-stranded DNA 
15 mciecule, comprising contacting the single-stranded DNA molecule and the double-stranded DNA molecule in the presence 
of a peptide described above. Advantageously, this peptide has the amino acid sequence shown in SEQ ID N0:1. SEQ 
ID M0:2 or SEQ ID N0:3. According to another aspect of this preferred embodiment, the targeted DNA region of the 
double-stranded ONA molecule encodes a mutant form of a protein. 

The present invention also provides a method of inhibiting transcription of a specific gene sequence present on 
20 one strand of a double-stranded DNA molecule comprising contacting a peptide with the double-stranded DNA molecule 
anj with a single-stranded DNA molecule, which single-stranded DNA molecule is homologous to at least a portion of 
the gene sequence and hybridizes therewith to form a three-stranded molecule, wherein the peptide is a peptide described 
above. Preferably, the peptide has the amino acid sequence shown in SEQ ID N0:1, SEQ ID N0:2 or SEQ ID NO:3. 
Advantageously, the specific gene sequence on the double-stranded DNA molecule encodes a mutant form of a protein. 
25 Alternatively, the specific gene sequence on said double-stranded DNA molecule encodes an oncogene or viral transcript. 

Brief Description of the Figures 
Figure la illustrates the peptides used in DNA binding experiments. FECO is the naturally-occurring sequence 
of residues 193-212 of L coll RecA (SEQ ID N0:1). WECO (SEQ ID N0:2), YECO (SEQ ID N0:3), HECO (SEQ ID N0:4) 
and AECO (SEQ ID N0:5) contain, in place of phenylalanine at position 203, tryptophan, tyrosine, histidine and alanine, 
30 respectively. WT-14 is the naturally occurring sequence of residues 195-209 of RecA (amino acids 4-17 of SEQ ID 
NO;1l. WT Scr (SEQ ID N0:6) is a randomly scrambled sequence having the same amino acid composition as FECO. 

Figure 1b illustrates binding of the FECO, WECO, YECO and AECO to ssDNA (0.5 j/M) in the presence of 10 
mM MgC1 2 . The peptide concentration is shown on the x-axis and the percentage of ss DNA bound is shown on the 
y-sxis. 

35 Figure 1c illustrates binding of the FECO, WECO, YECO and AECO to dsDNA (1.0 pM) in the presence 

of 10 mM MgCI 2 . The peptide concentration is shown on the x-axts and the percentage of dsDNA bound is shown on 
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ths y-axis. 

Figure Id illustrates binding of the FECO, WECO and YECO to ssDNA (0.5 /;M) in the absence of MgCI 2 . 

Figure le illustrates binding of the FECO, WECO and YECO to dsDNA (1.0 ;/M) in the absence of 
MgCI 2 . The peptide concentration is shown on the x-axis and the percentage of rfsQNA bound is shown on the y-axis. 
5 Figure 2a illustrates a circular dichroism (CD) spectrum of a reaction mixture in which a 57-mer ssDNA molecule 

(CTGTCTACTCTCGAGGTTAACCCGTGCGAATTCTACGATTGGTGCGGCCGGTATATC;SEQ ID N0;7; 0.3 mM) was incubated 
at pH 7.5+/- 0.2 in the pressnce of 0.1 mM FECO ( o ) or with FECO at concentrations of 0.20 mM (OK 0.25 mM (□) 
and 0.40 mM U). 

Figure 2b illustrates a CD spectrum of 0.10 mM WECO h the presence or absence of 0.20 mM ssDNA (SEQ 
10 ID N0:7) at pH E.O +/-0.2. The inset shows the enlargement of the spectral region from 260-300 nm to emphasize the 
intensity change at 278 nm. 

Figure 2c illustrates a CD spectrum of 0.10 mM WECO in the presence (O) or absence (0| of 0.40 mM 
dsOHA from the Haelll digestion products of plasmid pUCIS at pH 6.0 +J-0.2. The inset shows the enlargement of the 
sp:-:tral region from 260-300 nm to emphasize lack of the intensity change at 278 nm. 
15 Figure 2d illustrates a CD spectrum of 0.40 mM HEC0 in the presence [O) end absence (O) of 0.4 mM ssDNA 

at pH 7.5+M3.2. 

Figure 3 summarizes the homologous recombination activity of 380 RecA loop L2 mutants in an in vivo 
recombination assay. The amino acid positions of the wild type L2 peptide are shown on top of the figure. 

Figure 4 shows the DNA repair activity of 380 RecA loop L2 mutants in in vhto UV and mitomycin C resistance 

20 assays. 

Figure 5a illustrates the dsDNA binding activity of mutant RecA203A protein. The peptide concentration is 
shown on the x-axis and the percentage of ssDNA bound is shown on the y-axis. 

Figure 5b illustrates the ssDNA binding activity of mutant RscA203A protein. The peptide concentration is 
she wn on the x-axis and thE percentage of ssDNA bound is shown on the y-axis. 
25 Figure 6 is a CD spectrum illustrating the conversion of peptide WECO from a random coil to ^-structure upon 

binding of the ATP analog ATP-f-S . 

Figure 7 illustrates that ATP-k*S binding leads to the oligomerization of WECO. WECO or WECO+ATP-k-S was 
applied to a gel filtration column and the absorbance of protein fractions at 280 nm was platted es a function of gel 
filtration retention time. 

30 Figure 8 is 5 CD spectrum illustrating that the appearance of ^-structure parallels WECO/ATP-k-S formation. 

The: circles indicate absorbance values at 280 nm and the diamonds indicate molar residue ellipticity. 

Figure 9 shows a surface plasma resonance profile which reflects the binding of WECO to ATP and ADP. 
Figure 10 is a surface plasma resonance profile showing the inhibition of WECO binding to ATP by 1 mM ATP. 
0.1 mM ssDNA and 1 mM ATP-k-S. Time is shown on the x-axis and response is shown in the y-axis. 
35 Figure 1 1 is a surface plasma resonance profile showing the inhibition of WECO bitding to ATP in the presence 

of massing concentrations of single-stranded DNA. Time is shown on the x-axis and response is shown in the y-axis. 
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Detailed Description of the Invention 
The sequence-specific targeting of DNA by RecA has important therapeutic applications including site-specific 
gene inactivation, correction of gene mutations and the control of gene expression. Other applications include sequence- 
spacific mapping and manipulation of complex genomes, particularly the human genome. 
5 Sequence-specific targeting of DNA is described U.S. Patent No. 5,460,941. This patent describes contacting 

ds DNA with a ss oligonucleotide probe complementary to a specific sequence in the presence of the entire £ coli RecA 
protein, resulting in triplex formation m vitro. 

However, the use of whole, intact RecA protein for specific targeting of DNA presents several drawbacks 
typically associated with whole proteins. These drawbacks include the difficulty of whole protein uptake by cells, 
10 delivery of the protein in sufficient concentration to cells to mediate an effect, generation ol a significant antibody 
response, susceptibility to biodegradation and the difficulty in modifying large proteins to increase their bioactivity. In 
addition, the production of recombinant protein is labor-intensive and expensive in comparison to the synthesis of small 
peptides. 

Thus, there is a need for a smaller, more stable molecule capable of being easily introduced into cells both in 

15 vitro and in vivo for targeting ds DNA in the genome, both for therapeutic and mapping/cloning applications. The present 
invention addresses this need. 

The present invention relates to the discovery that a twenty amino acid peptide spanning the ssDNA binding 
domain of the homologous recombination-promoting RecA protein binds to both ssDNA and dsDNA, promotes unstacking 
of the DNA base pairs, and, most importantly, selectively catalyzes the formation of joint ssDNA-dsDNA molecules at 

20 a rsgion of homology between the ssDNA and dsDNA. The single-stranded DNA binding domain of £ colt RecA is in a 
disordered mobile loop (L2) in the crystal structure {Gardner et al., Eur. J. Biochem., 233:419, 1995). It is quite 
umixpucted that such a short peptide is capable of carrying out the most unique and characteristic reaction mediated 
by the whole RecA protein: pairing of homologous single- and double-stranded DNA molecules and the formation of joint 
molecules. It is also quite surprising that these short peptides can mediate such complex reactions, as the native RecA 

25 protein has a molecular weight of about 20 times that of the peptide and it would be expected that additional regions 
of the protein beyond the 20 amino acids would be required for its activity. 

The short peptides of the present invention have the sequence of the £ or// single strand binding domain, or 
sequence variations thereof. As explained in detail hereinbslow, these peptides can selectively target a single-stranded 
oligonucleotide to a homologous sequence contained in a double-stranded DNA molecule. This selective targeting has 

30 therapeutic applications, including inactivation of deleterious genes and control of gene expression, and may be used to 
map complex genomes, including the human genome. 

The 20 amino acid peptide capable of promoting homologous DNA pairing corresponds to amino acids 193-212 
of native RecA (SEQ ID N0:1) and variations thereof that retain activity. A variety of such variations on the 20-mer 
can be made. The synthetic peptides and oligonucleotides described herein were made according to the techniques 

35 described below in Example 1. 
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Example 1 

Oligonucleotide and Peptide Synthesis 
Oligonucleotides were synthesized on an Applied Biosystems model 36DB synthesizer and purified by 
polyacrylamide gel electrophoresis (PAGE). The peptides used in the examples herein were made on an Applied 
5 Biosystems model 431 A synthesizer, purified by reverse phase high performance liquid chromatography (HPLCI on a C-18 
column and dissolved in 20 mM CAPS (3'[CycIohexylamino]-1-propane<suifontc acid; Sigma, St. Louis, MO), pH 10.6. 

The 14-mer, WT-14, described above in the brief description of Figure 1a, lacks the first three and last three 
amino acids of the active 20-mer. This 14-mer did not promote the formation of joint DNA molecules. As shown in 
Figures 3 and 4, 193N, 194Q, 196R, 21 IB and 212G are absolutely required for recombination and DNA repair events 
10 in cells. Thus, RecA-derived peptides having 20 amino acids or more, and including amino acids 193, 194, 196, 211 
and 212, are within the scope of the invention. Although longer peptides are included within the scope of the present 
invention, the preferred upper limit of the length of the peptide would be about 50 amino acids, i.e. the 20-mer plus 30 
additional residues of native RecA in combination on the N*terminal and/or C-terminal sides of the 20-mer. Peptides 
longer than about 50 amino acids would be more difficult to manipulate, more difficult to introduce into cells and subject 
15 to more of the drawbacks described above in connection with whole proteins, including degradation and immunological 
intolercnce. 

It will be appreciated that 20 amino acid peptides containing one or more amino acid substitutions in various 
positions of the sequence shown in SEQ ID N0:1 are also within the scope of the invention. Many amino acid 
substitutions can be made to the native 20-mer sequence and retain the recombination and repair-promoting activities 

20 of the peptide as shown in Figures 3 and 4. However, in the preferred embodiment of the invention, the asparagine, 
glutamhe, arginine, glycine and glycine residues at positions 193, 194, 196, 211, and 212 of native RecA, respectively, 
are present in the corresponding positions of the peptides of the invention (positions 1, 2, 4, 19 and 20], as these 
residues are essential for activity of RecA-Iike proteins. 

Variations cf SEQ 10 NO: 1 contemplated for use in the present invention include minor insertions, deletions 

25 or substitutions that do not substantially affect its ability to bind and unstack ONA and to catalyze the formation of joint 
ssD-JA-dsDNA molecules. For example, conservative amino acid replacements are contemplated. Such replacements are, 
for example, those that take place within a family of amino acids that are related in their side chains. The families of 
am no acids include the acidic amino acids {aspartic acid, glutamic acid); the basic amino acids (lysine, arginine, histidine]; 
the non-polar amino acids (alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan; and the 

30 uncharged polar amino acids (glycine, asparagine, glutamine, cysteine, serine threonine, tyrosine); and the aromatic amino 
acids (phenylalanine, tryptophan and tyrosine]. In particular, it is generally accepted that conservative amino acid 
replacements consisting of an isolated replacement of a leucine with an isoleucine or valine, or an aspartic acid with a 
glutamic acid, or a threonine with a serine, or a simitar conservative replacement of an amino acid with a structurally 
related amino acid, in an area outside of the polypeptide's active she will not have a major effect on the properties of 

35 the oolypeptide. 

in fact, any peptide derivative of SEQ ID N0:1, including conservative substitutions, non-conservative 
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substitutions, mixtures thereof, as well as truncated peptides and longer peptide sequences containing additional amino 
acids of native RecA or sequence variations thereof may be tested as described in the examples set forth hereinbeiow 
to determine their ability to bind ssDNA/dsDNA, unstack ssDNA snd catalyze joint ssDNA-dsDNA molecule formation. 
Such routine experimentation will allow the skilled artisan to easily screen any desired peptide. 
5 The peptides of the invention bind to both ssDNA and dsDNA m vitro, both in the presence and absence of 

magnesium ion as evidenced by a filter binding assay, as described below in Example 2. 



Example 2 

ssDfJA and dsDNA Binding assays 

10 dsDNA was prepared by annealing of 5' 32 P labeled oligonucleotide BS-S1 

(5'-3GCCGCTCTAGAACTAGTGGATCCCCCGGGCTGCAGGAATTCGATATCAAGCT-3';SEQ ID N0:B) and unlabeled BS-S2 
(5' AGCTTGATATCGAATTCCTGCAGCCCGGGGATCCACTAGTTCTAGAGCGGCC-3';SEQ 10 NO:9). BS-S1 spans positions 
742-620 in the poiylinker region of the plasmid Bluescript SK + (Stratagene, La Jolla, CA) and BS-S2 is the complement 
of E1S-S1. Binding Reactions were performed in a volume of 40 //I and contained ^P*labeled 0,5 fm ssDNA or 1 //m 

15 dsDNA (expressed as phosphate concentration), 40 mM Tris-borate, pK 7.5, 0 or 10 mM MgCI 2 , 20 mM NaCI, 10 j/g/ml 
BSA, 10 mM CAPS, pH 10.6 and peptide concentrations of up to 100 j/m; final pH value was B.3. After 3 30 minute 
incubation at room temperature, reaction mixtures were filtered using a double-filter system (Wong et a!.. Proc. Nati 
Acad. Sci. USA, 90:5428-5432, 1993) with BA85 nitrocellulose filters and NA45 DEAE membranes (Schleicher and 
Schuell. Keene, NH). Ait experiments were performed in at least triplicate. Data were quantitated with a phosphorimager 

20 (Molecular Dynamics). 

The results shown in Figures lb and 1c indicate that an aromatic amino acid (phenylalanine, tyrosine or 
tryptophan] is needed at amino acid position 203 of RecA (amino acid 11 of SEQ ID NOS: 14) for binding to both ssDNA 
ant; tisDNA. The sequence shown in SEQ ID N0:4 in which residue 11 is an stantne did not bind to either ssDNA or 
dsDiMA (Figure lb, c). Further, HECO, VVT-14 and WT-Scr did not bind to either ssDNA or dsDNA. WT-Scr contains the 

25 same amine acid composition as FECO, but in a scrambled sequence. The phenylalanine residue at position 203 is the 
most conserved interna! position among prokeryotic RecA proteins and their eukaryotic homology such as Dmcl and 
Rad51 (Story et al., Science, 259:1892*1895, 1993!. Although the peptides bind less tightly in the absence of 
magnesium ions than in their presence (Figures Id, e), the fact that 100-200 mM NaCl can substitute for 10 mM Mg 2+ 
indicates that this is not 8 specific divalent ion effect. As expected, ATP, ADP and ATPyS had no effect on binding 

30 of the peptides to DMA. 



The peptides promote unpacking of DMA as assessed by increased sensitivity of thymine residues to the 
modifying reagent potassium permanganate. The ability of the aromatic amino acid-containing peptides to unstack DNA 
was addressed es described in the following example. 

35 
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Example 3 

Assay for Unstacking Ability 
RecA is known to extend both ssDNA and dsONA by 50% (Koller et al., in Mechanisms of DMA Replication end 
Recombination, N. Cozzarelli, Ed., Alan R. Liss, Inc., New York, 723729, 19831. The unstacking of the ssDNA is 
5 proposed to be essential for RecA-mediated DNA triplex formation between ssDNA and ds DNA (Camerini-Otero et at, 
Ceil, 72:217-223, 1993). Unstacked bases are more accessible to modification by potassium permanganate (PP), an 
agent that attacks thymines in a direction perpendicular to the base plane (Hayatsu et al., Biochem. Biophys. Res. 
Commun., 29:556-561 r 1967). This modification is monitored by chemically-induced strand cleavage at the modified base 
(Maxam et al., Meth. EmymoL, 65:499-560, 19B0). 

10 Peptide DNA complexes were prepared as described in Example 2, in a total volume of either 100 or 140 pi. 

In the tetter case, 40 p\ was used for filter binding and the remainder treated with PP. To bind RecA protein to BS-S1, 
2.7 //m RscA and 0.5 //m 32 P-iabe!ed BS-S1 were incubated in a volume of 100 fA containing 25 mM Tris-Hcl, pH 7.5, 
10 mM MgCI 2 , 20 mM NaCl 0.4 mM dithiothreitol (DTT), 0.5 mM EDTA f 0.3 mM ATPpS, 1.1 mM ADP and 10 ^rg/ml 
8SA for 30 min. at 37°C. Complexes were incubated with 0.5 mM PP for 1 min. at room temperature. Reactions were 

15 terminated with DMS-stop solution (Maxam et al., ibid.) and precipitated with ethanol. After treatment with 1 M 
pyrolidone (95°C, 20 min.), the samples were separated on a polyacrylamide gel containing 20% urea (Sequagel, National 
Diagnostic). The gel was dried and exposed to x-ray film. 

As expected, the ss oligonucleotide was much more reactive to PP in the presence of RecA than in its absence, 
as i i jstra ted by tha higher intensities of the bands. Peptides FEC0 and WECO induced pronounced hypermodification 

20 of thymine residues in the ssDNA. As a result, little if any full size fragments remain and most of the radioactivity is 
found in very short oligonucleotides, suggesting multiple hits by PP. The reactivity of the ssDNA to PP was also 
increased in the presence of YECQ, but to a lesser extent than FECO and WECO. The shuffled non-DNA-binding WT-Scr 
peptide had no effect. These results indicate that as with intact RecA, WECO and FECO unstack ssDNA upon binding 
thereto. 

25 The extent of modification of the thymine residues in the ssDNA increased with FECO concentration in a manner 

which paraflsled the binding prcfile. Interestingly, although the binding of FECO to dsDNA was similar to that for ssDNA, 
no distinct modification of dsONA was observed using peptide concentrations between 0 and 25 pm. This suggests that 
the peptide uses different modes in binding to ssDNA versus dsDNA and that binding to dsDNA does not occur at single- 
stranded regions in the duplex. 

30 

The structure of the peptides themselves is changed upon binding to DNA as assessed by circular dichroism 
spectroscopy. In the absence of DNA, the peptides sssume a random-coil conformation. However, upon DNA binding, 
the peptides shift from random-coil structure to £-sheet. This conformational change appears important in binding to 
ssDfvA tnd dsDNA and unstacking the ssDNA. 

35 Because the structure of the DNA changed upon peptide binding, the ability of the peptides to change their 

secondary structure (conformation) upon binding to DNA was investigated using CD spectroscopy as described below. 
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ExamplB 4 

CD Spectroscopy of peptides and peptide-DNA complexes 
CD spectroscopy measures protein and nucleic acid secondary structure. All CD spectra were measured using 
a Jasco 720 spectropolarimeter. The instrument was calibrated with a 0.06% ammonium [+)-d 1D -camphorsuIphate 
5 solution which generated a CD intensity of 190.4 millidegrees (mdeg) at 290.4 nm. A baseline of 10 mM Naf^PC^ 
buffer, was subtracted out as background. Cells of 0.20, 0.50 or 1.00 mm pathlength were used and maintained at 
22+H°C. The response and band width for data collection were 2.0 sec and 1.0 nm. respectively. Spectra were 
smoothed using Savitsky Golay smooth and are presented at 2 nm interval and in molar residue ellipticity (degree'M* 
^cm* 1 ) or as raw data (mdeg). 

10 As shown. in Figure 2a, binding of FECO to ssONA induced a conformational transition from random coil to 

mostly ^-structure. While the dominant negative CD peak at approximately 198 nm at low peptide concentration 
indicated a random coil structure, the amplitude increase at both 190 nm and 215 nm at higher peptide concentrations 
is nh3racteristic of a concentration-dependent binding of a structure with a high ^-structure content on the DNA (Woody 
et al., in The Peptides, Analysis, Synthesis, Biohgy, V. Hruby, Ed., Academic Press, Inc., New York, 1985, pp. 15-114). 

15 Thii peptide remeins i random coil in the absence of DNA, even at higher peptide concentrations. The isodichroic point 
of 207 nm indicates that the data can best be described as a transition between two, and not more, states. 

A similar structural change is also evident in WECO (Figure 2b). However, WECO forms a more complete /?• 
structure than FECO under the same conditions, most likely due to its higher affinity for DNA. All other peptides tested 
remained predominantly random coil structures in the presence of ssDNA, as typified by the HECO CO profile (Figure 2d). 

20 In addition, the reduction in the 276 nm DNA band for the WECO-ssDNA complex (Figure 2b, inset) supports the chemical 
modification data showing that the ssDNA is unstacked upon complex formation. Binding to dsDNA also induces /? 
structure in WECO but, consistent with the chemical modification data, results in only minor changes in the 278 nm band 
(Figure 2c). 

The CD changes correlate well with the filter binding and chemical modification data, suggesting that CD and 
25 those methods are complementary and that the conformational change of the peptide from random coil to /? structure 
is important in binding to ssDNA and dsDNA and unstacking the ssDNA. 

Most importantly, the peptides of the invention promote joint molecule formation between ssDNA and dsDNA 
molecules having one or more regions of homology. The formation of joint molecules by the peptides does not require 
30 an external energy source, es nucleotide cofactors are unnecessary. This has also been demonstrated for RecA. It is 
contemplated that this joint molecule formation results in homologous DNA recombination and subsequent heteroduplex 
formation. 

The ability of the peptides of the invention to promote homologous pairing of DNA molecules will allow 
introduction of a nucleic acid sequence into a precise location in the human genome. In a preferred embodiment, the 
35 peptide and a ssONA sequence having homology to a particular sequence contained within the genome are introduced 
into particular cells in vivo. This may be accomplished, for example, by encapsulating the peptide and ssDNA in a 
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liposome lollowed by injection into the bloodstream. The liposomes will fuse with cells in vivo, resulting in intracellular 
delivery of peptide and ssDNA sequence. The ssDNA sequence will then pair wrth its corresponding genomic sequence, 
forming a ssDNA-dsDNA three-stranded molecule. Alternatively, the peptide and ssDNA sequence may be directly 
coinjected. Because the peptide is small there is reduced risk of a substantial immunological response compared to a 
5 large protein such as RecA which would elicit a substantial immune response. In addition, the peptide may be chemically 
modified by well known techniques to increase its stability. 

RecA can bring together any two homologous ONA molecules and promote their recombination. Although FECO 
end WECO both bind ssDNA and dsDNA, assume a well-defined structure as a rasult of this binding, and unstack ssDNA, 
it v/ould be unexpected that such short 20 amino acid peptides would selectively pair a ssDNA with a homologous 
10 dsDNA. This possibility was addressed in the following example. 

Example S 

Homologous ONA pairing by RecAderived peptides 
The formation of joint IssDNA-dsDNA) molecules was investigated by incubating 300 ng pBIuescript KS* 
(supercaileti plasmid) and 13 ng 32 P-BS-S1 with either 13.6 //m RecA or 50 im peptide under conditions used for RecA 
15 and peptide-ssONA formation described in Example 3, with the exception that reactions involving peptides were allowed 
to proceed for 90 min. at 45°C. The ratio between ss oligonucleotides and the target ds ONA sequence was 9-10 
oligonucleotides per dsDNA molecule for both the peptide and RecA reactions. Reactions were quenched by the addition 
of 2% sodium dodecyl sulfate (SDS) and 20 mM EDTA, followed by electrophoresis on a 1% agarose gel containing 5 
mM magnesium acetate and 0.5 //g/ml ethidium bromide for 4 hours in a cold room. The gel was dried and exposed 
20 to way film. 

Surprisingly, FECO and WECO catalyzed the formation of stable joint ssDNA*dsDNA(BS-S1lpBlue$cript) molecules 
similar to those formed by RecA. These joint molecules are detected as emigration of radiolabel with the plasmid on 
the agarose gel. We were able to recover from 10% to 20% as many joint molecules from reactions wrth these two 
peptides compared to the whole RecA protein. Mo joint molecules were formed when BS SI was replaced with a duplex 

25 having the seme sequence or when a target plasmid lacking a region homologous to BS-S1, pUC19, was used. YECO, 
which hinds poorly to DMA, and other peptides which do not bind to ONA, did not promote formation of joint ssDNA- 
dsDNA molecules. As mentioned hereinabove, these peptides can form homology-dependent stable joint molecules in the 
absence of a nucleotide cofactor. Thus, this experiment illustrates the selective targeting of a ssDNA molecule to its 
complementary region on a dsDNA molecule. 

30 The fact that DNA binding peptides derived from RecA are able to form joint molecules between homologous 

DNAs indicates that this domain comprises at least part of the active site of the whole protein responsible for DNA 
pairing. That the reaction proceeds efficiently in the absence of the remainder of the protein and in the absence of 
nucleotide cofactors or their analogs suggests that a major role of the remainder of the protein, in addition to its other 
biochemical activities, is to modulate access to this pairing domain in a nucleotide cof actor-dependent manner. 

35 

To determine how the local protein structures in L2 relate to function, saturation mutagenesis of the L2 region 
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was performed by introducing all possible amino acid substitutions at all 20 positions resulting in 380 RecA mutants. 
Saturation mutagenesis is described in the following example. 

Example 6 
Saturation mutagenesis of RecA L2 region 
5 Saturation mutagenesis was performed according to the Bio-Had Muta-Gene Phagemid in vitro mutagenesis 

instruction manual [Bio-Rad, Hercules, CA). Briefly, wild type RecA cDNA was inserted into the pBluescript SK + vector 
which can exist in both ssDNA and dsDNA forms and ss ONA was isolated. Synthetic complementary oligonucleotides 
carrying the desired mutation were annealed to the ssDNA and used as primers for complementary strand synthesis. 
£ ccfi Xl-1 blue were transformed with the synthesized products. Cells were plated on ampicillin agarose plates and 
10 incubated at 37°C overnight. Single colonies were picked, cultured overnight and their DNA extracted to verify the 
identity of each mutant by DNA sequencing using the SEDUEfJASE™ Version 2.0 sequencing kit (United States 
Bic chemical Corp., Cleveland, OH). 

The activity of each RecA mutant was determined using three in vivo assays which assess the ability of RecA 
to promote homologous recombination and recombinations! repair of recombinant DNA. In the first assay the RecA 
15 mutants were tested for their ability to support plaque formation by the defective reef gam' A phage in a A 
complementation assay scoring for the recombinational activity of RecA. 

in the second and third assays, the RecA mutants were tested for their resistance to treatment with UV light 
or mitomycin £ both of which induce DNA damage and activate the SOS recombinational repair system, scoring for the 
recwnblnational repair. These assays are described below. 
20 % 

Example 7 

A complementation recombination assay 
This assay is based on the necessity of bacteriophage A to form concatemers of its genome before processing, 
packag : ng and lysis of the host £ coli can occur. Concatemer formation of the A genome in £ coli can proceed by 
25 utilizing the A red or gam gene products, or the host RecA protein. The red gene product and RecA protein can 
recombine the A genome, leading to concatemer formation. The gam gene product promotes rolling circle replication of 
the phage genome, promoting concatemer formation by binding directly to RecBCD end inhibiting its exonuclease activity. 
The RecA cDNA and mutants thereof were expressed from a plasmid vector in a RecA', wild type RecBCD £ a?// host 
XL-1 Blue cells. The ability of the expressed RecA, or mutants thereof, to allow a AredgamA phage to form plaques 
30 was determined relative to wild type RecA (Figure 3). 



Example 8 

Recombinational repair activity assays 
Cell cultures of each RecA mutant were grown to en 00 600 of 0.5 and diluted with culture media to an OD B qq 
35 of O.OE. Three //I of each dilution was spotted onto agarose plates containing ampicillin for the UV resistance assay 
and on agarose plates containing ampicillin and mitomycin C for the mitomycin C resistance assay, respectively. For the 



WO 96/40765 PCT/US96/09959 

12- 

UV resistance assay, the plates were exposed to different dosages of UV light {0, OA 0.8, 1.2, 1.8. 3.0 mJl using a 
"Stratalinker 1800" (available from Strategene, Inc., La Jolia, California). For the mitomycin C resistance assay, different 
concentrations of mitomycin C (0.1, 0.2, 0.3 //g/ml) were used. After incubation of plates overnight at 37°C, culture 
spots were evaluated. The degree of RecA resistance to UV and mitomycin C exposure is proportional to recovery of 
5 the culture spot compared to the positive and negative controls. For both assays, wild type RecA in pBluescript and 
pBluescript both in XM blue served as positive and negative controls, respectively. The resistance of each mutant 
relative to wild type RecA is shown in Figure 4. 



The results of these in vivo assays indicate that while the highly conserved amino acids 193N, 194Q, 196R, 
10 209T, 21 1 G r and 21 2G are absolutely required for recombination events in cells, other highly conserved amino acids in 
RecA proteins (198K and 203F) are much less stringently required. For the less conserved amino acids 1951, 1991, 
204 3, and 205N, only few substitutions can be tolerated which will still result in recombinational and repair activity. 
In positions 197M, 200G, 201V, 202M, 206P and 20BT, the majority of substitutions are well tolerated, only minimally 
affBCting recombinational and repair activity of the mutants. The mutants 200G, 20QS. 2WH, 20ST, 206R, 20SS, 206A 
15 and 208T were more active than wild type RecA, whereas 204A and 208L may be dominant negative mutants, based 
on Their ability to prevent recombinational activity of wild type A bacteriophage, as described below in Example 9. 

Example 9 

Prevention of Recombinational Activity Assays 
The prevention of recombinational activity of wild type A bacteriophage was determined in an in vivo assay in 

20 which we studied the loop 2 point mutants of RecA. This assay is based on the requirement of bacteriophages to form 
consumers of its genome before processing, packaging and lysis of its £ cofi host can occur. Concatamer formation 
of the A genome in £ cofi can proceed via three pathways: utilization of the A red gene product, the A gam gene 
product, or the host RecA protein. The A red gene product and £ cofi RecA protein can recombine the A genome, 
leading to concatamers. The A gam gene product allows the rolling circle replication of the phage genome, resulting in 

25 concatamers, by binding directly to RecBCD and inhibiting its exonuclease activity. The RecA cDNA, and mutants 
thereof, was expressed from a piasmid vetor in a RecA deficient wild type RecBCD E.coli host. The ability of the 
expressed RecA, or mutants thereof, to allow a LrodgamA phage to form plaques was than determined. As a control, 
the mutant subclones were also infected with the wild type A phage, Alk? (Stratagene), which has functional red and 
gam genes. Mutant subclones 204A and 208L were lysed poorly or not at all by SZAP. Thus, these RecA mutants 

30 appear to interfere with either the red, gam or both gene products. Thus, while not wishing to be bound by any 
particular interpretation of these results, we believe these mutants are dominant negative mutants of RecA. 



In general, the recombinational repair assay was more stringent than the recombinational assay. Treatment of 
cells with mutagens induces the SOS response, initiated by the RecAmediated cleavage of the LexA repressor protein. 
35 Differences in behavior of the various RecA mutants also indicates that the L2 region is involved in the interaction of 
RecA with LsxA. To determine whether mutants active in the recombinational assay but inactive in recombinational repair 
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are unable to cleave LexA, the RecA203A mutant was purified as described in the following example. This mutant was 
completely active in the recombinational assay, hut inactive in recombinations! repair {Figures 3 and 4). 

Example 10 
Purification of RecA mutants 

5 The recA203A gene was subcloned into the pET9 expression vector (Novagen) and used to transform BLRIDE3) 

cells containing the pLysS plasmid. (These cells are a derivative of the commercially available BL2KDE3) cells (Novagen) 
with the wild type recA deleted}. A 500 ml culture of BLR(DE3)lpLysS, pET9-203A cells were grown to an 0D 600 of 
0.7, and RecA203A expression was induced by the addition of isopropylthio-^-O-galactoside(IPTG). Cells were grown 
for 34 hours then harvested by centrifugation. All of the following procedures were performed at 4°C unless stated 

10 otherwise. The cell pellet was resuspended in 70 ml 50 mM Tris-HCI, pH 8.0, 2.0 mM EDTA. Cells were lysed by 
sonication and cell debris wes removed by centrifugation at 18,000 rpm for 1 hour. A 10% solution of polymin P, pH 
7.8 was added to the cleared lysate dropwise to a final concentration of 0.5%. The solution was stirred for 30 min, 
and the protein precipitate was collected by centrifugation at 12,000 rpm for 30 min. The pellet was homogenized in 
10 mM Tris-HCI, pH 7.5, 150 mM (NH 4 ) 2 S0 4 , 10% (v/v) glycerol, 1 mM EDTA, 1 mM DTT. The precipitate was 

15 collected by centrifugation at 17,000 x g for 30 min, then resuspended in homogenization buffer containing 300 mM 
INH-I^SD^. The protein solution was loaded directly onto a MonoQ column (Pharmacia) previously equilibrated with 0 
buffer (20 mM Tris-HCI, pH 7.5, 1 mM EDTA). Protein was etuted in Q buffer with a linear salt gradient of 0-1.0 M 
NaCI. Fractions were analysed by SDS PAGE. RecA-containing fractions were pooled, dialyzed into 20 mM Tris-HCI, 
pH 7.5. 10% glycerol, 1 mM DTT, 0.1 mM EDTA and stored at -80 e C. 

20 The biochemical activity of the RecA203A was tested in in vitro binding reactions as described in the previous 

examples. RecA203A was almost indistinguishable from wild type protein in binding to dsDNA (Fig. 5A), but bound to 
ssDNA with a slightly lower affinity than wild type RecA (Fig. 5B). However, the mutant protein was entirely active 
in synaptic complex formation and strand exchange reactions. Moreover, the rate of RecA203A~mediated strand exchange 
was higher than wild type RecA. Thus, RecA203A is proficient in both in vivo and in vitro recombinational assays. 

25 RecA203A, wild type RecA and RecA203H were assayed for LexA cleavage m vitro. RecA203H, exhibiting a 

wild type phenotype in both in vivo assays, displayed wild type protein levels of LexA cleavage activity. In contrast 
RecA 203A never reached wild type RecA levels of LexA cleavage, even under optimal conditions in the presence of 10 
fold excess ssDNA. These results indicate that the L2 region of the RecA protein plays an important role in initiation 
of ^combinational repair. 

30 _ n 

Because RecA is a DNA-dependent ATPase and ATP-dependem DNA binding protein, CD experiments were 
performed to determine whether L2 is the binding site of the y-phosphate group of nucleotide cofactors. 

Example 11 

Binding of nucleotide cofactors to WECO peptide 
35 When increasing concentrations of ATP-k-S were incubated with 50 //M WECO, a concentration-dependent 

increase in the formation of ^-structure was observed (Figure 6). Under the same buffer conditions, incubation of WECO 
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wiih ADP and pyrophosphate resulted in a minimal change. ATP, GTP, CTP, tripolyphosphate and tetrapolyphosphate 
can also induce such a shift to ^-structure. The binding of ATP-y-S to WECO was atso shown by its exclusion from 
a gel filtration column (Smart System, Pharmacia) after being fixed with 0.2% glutaraldehyde. 

Based on miniaturization of liquid chromatography, this system enables the purification and secure recovery of 
5 low microgram and nanogram levels of material A SUPERDEX™ 75 PC 3.2/30 column was equilibrated with 20 mM 
sodium cacodylate, pH 7.6, 20 mM NaCL WECO peptide in water (0.1 mM final concentration) was heated at 65°C 
for 15 min, cooled to room temperaturs and mixed with 20 mM sodium cacodylate, pH 7.6, 20 mM NaCI; ATP-f-S and 
glutaraldehyde (0.2% final concentration). The mixture was incubated at 37°C for 5 min prior to injection into the 
column (injection volume, 20 pi; flow rate, 100 //1/min). Eluted peaks were monitored at 280, 260 and 212 nm. The 
10 column profile is shown in Figure 7. Further, as shown in Figure 8, the increase in ^-structure parallels formation of 
the WECO-ATP-jaS complex. 



To prove that the peptide structural change is due to binding of nucleotide cof actors, Surface Plasma Resonance 
(SP3I was performed as described below. 

15 Example 12 

Surface Plasma Resonance Assay 
The SPR assay involves the use of biosensors. Real-time Biomolecular Interaction Analysis (BIA) uses SPR to 
monitor bicmotecular interactions between an immobilized ligand and a flow-through analyte. Detection depends on 
changes in the mass concentration of macromolecules at the biospecific interface. SPR is an optical phenomenon arising 

20 in thin metal films under conditions of total internal reflection and produces a sharp dip in the intensity of reflected light 
at v. specific angle, the resonance angle. One interactant is immobilized on the sensor surface which forms one wall of 
a micro-flow cell. Solution containing the other mtEractant(s) flows continuously over the sensor surface, As molecules 
from solution bind to the immobilized interactant, the resonance angle changes and a response is registered. Details 
regarding this procedure may be found in the BIAtechnoIogy handbook (Pharmacia). 

2E SPR was performed using the Pharmacia BIA system (Pharmacia Biosensor AB) at a flow rate of 10 pljmin. 

The surface of a micro-flow cell was activated with l-ethyl-3-(3-dimethylamincpropyl) carbodiimide (EDC)fN- 
hycroxysuccinimide (NHS) for 7 minutes, then incubated with ATP or ADP in 50 mM Tris-HCI, pH 8.3, 20 mM NaCI for 
6 minutes to allow coupling of these compounds. The surface was then deactivated with ethanolamine. The running 
buffer was 50 mM NaHP0 4 , pH 8.5, 100 mM NaCI and the binding buffer was 50 mM NaHP0 4 , pH 8.5, 100*150 mM 

30 NaCI. The results are summarized in Figures 9-11. As can be seen in Fig. 9, WECO bound to an ATP-derivatized 
substrate; however, WECO did not bind to an ADP-derivatized or blank substrate. Figure 10 shows the inhibition of 
WECO binding to ATP in the presence of various competitors: ATP, ssDNA and ATP-y-S. Figure 11 shows the decrease 
in binding of WECO to ssDNA in the presence of increasing concentrations of ssDNA. 

35 The ONA targeting technique described herein is also valuable in correcting a mutant gene sequence 

characteristic of many genetic disorders. Examples of genetic disorders believed correctable through this technique include 
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sickle cell anemia, ^-thalassemia, c/stic fibrosis, muscular dystrophy and the like. The complementary {wild type) ssONA 
sequence will contain the correct nucleotide)*) at the positions) in the mutant dsDNA sequence which contains the 
mutation. The wild type sequence will then complement the mutation, resulting n expression of a normal protein. 

This technique will also allow the modulation of gene expression in a fashion similar to triplex approaches for 
5 antigene therapies. In these triplex therapies, the introduced oligonucleotide binds to the coding strand of a particular 
gene sequence, inhibiting its transcription into mRNA in the nucleus. Similarly, the ssDNA molecule may be targeted by 
ths peptides of the invention to a particular gene sequence for three stranded DNA formation and inhibition of 
transcription. In addition, gene sequences which encode regulatory proteins which either stimulate or inhibit the 
transcription of other genes may be targeted using the peptides of the invention. For example, if a gene encodes a 

10 product which stimulates transcription of an undesirable gene, the gene encoding the product may be targeted for three- 
stranded DNA formation in vivo. 

The peptides of the invention may be used to promote inhibition of transcription of oncogenes and deleterious 
virnl transcription products by promoting joint molecule formation between these genes and ssDNA molecules having 
homology to regions thereof. For example, the pi go BCR * AtaI gene occurring in chronic myelogenous leukemia (CML) or 

15 acute lymphoblastic leukemia (ALL) may be targeted with a ssDNA complementary thereto, resuming in three-stranded 
DNA formation and inhibition of its transcription. The ras oncogene is known to be present in a variety of human tumor 
typas and may play a major role in tumorigenesis. This oncogene may be targeted with its complementary sequence, 
or z fragment thereof, using the instant peptides. Similarly, vital transcription products encoded by many problematic 
human viruses could also be targeted, including the reverse transcriptase gene of retroviruses and nucleocapsid proteins 

20 of hepatitis virus, rabies virus or any other desired virus. 

In another preferred embodiment, ceils are removed from a patient and transacted ex vivo with a peptide of 
the invention in combination with a desired wild-type ssDNA having homology to a region of a mutant gene sequence(s). 
Th~ introduction of this wild-type sequence will result in the formation of a three-stranded DNA molecule at the region 
of homology between the ssDNA and ds DMA, thus complementing the mutant sequence(s), resulting in production of a 

25 functional gene product. Transfection may be performed using any of a variety of well known techniques including, but 
not limited to, electroporation, calcium phosphate DNA precipitation and OEAE-dextran. The cells are then expanded in 
culture, preferably in the presence of a growth factor or cytokine, prior to reproduction into the patient. This technique 
is particularly useful in genetic abnormalities expressed in blood cells, such as sickle cell anemia, thalassemias, and the 
like. Howaver, the technique can also be used to correct other genetic abnormalities through reintroduction of cells 

30 N bearing the wild type gene or gene fragment, in e particularly preferred embodiment, the wild type gene is introduced 
into bone marrow stem cells for the treatment of blood cell genetic abnormalities. The desired cells are then isolated 
from the transfected mixture and reintroduced into the marrow of the patient. 

Examples of the use of peptides of the invention to promote DNA pairing to complement a genetic defect are 
described below in Examples 13 and 14. 

35 
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Example 13 

Treatment of sickle ceil anemia 
Sickle cell anemia is a genetic disease characterized by cardiac enlargement, swelling ci the lymph nodes and 
anemia. This disease is caused by a point mutation at position 6 of the/? chain of hemoglobin in which a valine replaces 
5 the normally-occurring glutamate, markedly reducing the solubility of deoxygenated hemoglobin and resulting in sickling 
or polymerization of hemoglobin leading to sickling of red blood cells which blocks blood vessels, leading to local regions 
of low oxygen concentrations. 

Red blood cell precursors cells ere synthesized in the bone marrow and are responsive to the hormone 
erythropoietin (EPO). Bone marrow is isolated from a patient and the cells contained therein are cultured in a semi-solid 
10 matrix in the presence of EPO. After several days, colonies of about 60 erythrocytes appear, each founded by a single 
committed erythroid progenitor cell (Alberts et al., Molecular Bhiogy of the Cell, Second Edition, Garland Publishing, Inc., 
new York, p. 980). This cell is known as an erythrocyte colony-forming cell or CFC-E, and it gives rise to mature 
erythrocytes after about six division cycles or less. The CFC-E cells are isolated and cotransfected by electroporation 
with the peptide shown in SEQ ID NO:1 and a portion of a wild type ssDNA strand encoding the ^ chain of hemoglobin 
15 which contains glutamate at position 6. The transfected cells are then injected back into the patient. The patient is 
then monitored for diminishing symptoms of the disease. 



Example 14 

Treatment of Duchenne Muscular Dystrophy (DMD) 

20 Duchenne Muscular dystrophy is a degenerative muscle wasting disease caused by mutations in the dystrophin 

gene. Patients with muscular dystrophy are given multiple injections of a preparation containing the functional dystrophin 
gens coding strand (Koenig et aL, Cell, 53:219-226, 1988) and the peptide shown in SEQ ID N0:1. While under light 
anesthasia the patients are injected at 5 cm intervals into the entire skeletal muscle mass directly through the skin 
without surgery. Patient recovery is monitored by monitoring twitch tension and maximum voluntary muscle contraction. 

25 _____ 

cpk ma i ■!■ uac= 

Example 15 

Inhibition of p190 BCR ABL transcription in vivo 
In patients with acute lymphoblastic leukemia (ALU, a reciprocal chromosomal translocation t(9:22) (q34:q11) 
results in a truncated chromosome 22, encoding at the breakpoint a fusion of sequences from the c-ABL protooncogene 
3Q (Bertram, et al. Nature, 306:277-280, 1383) and the BCR gene (Groffen et al., Celt. 36:93-99, 1984). In ALL, the 
breskpoims usually occur in the first introns of both BCR and c-ABL, resulting in a p!90 BCR * ABL gene product (Kurzrock 
et al., tortra, 326:631-635, 1987). 

White blood cells are isolated from patients with ALL. These cells are cotransfected by electroporation with 

pro AOI 

the peptide shown in SEQ ID NO: 1 and a ssDNA complementary to the transcription initiation site of the p190 Dbr, ' HDL 
35 gene. The cells are expanded in culture, preferably in the presence of a cytokine or growth factor, then reintroduced 
into the patient. The patient is monitored over time for diminishing symptoms of the disease. 
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The gbility to stimulate homologous recombination with well-defined short peptides of approximately 20 amino 
acids will allow the delivery of these peptides into cells in large quantities and with great facility, a task virtually 
impassible with whole protein molecules. These peptides may also be chemically modified using well known techniques 
5 to increase their stability and bioavailability, i.e., methylation, acylation, t-butylation and the like. 

Another important use of the peptides of the invention is in the mapping and cloning of large complex genomes. 
The use of whole RecA for site-specific cleavage of DNA is currently being used « vitro for the mapping and cloning 
of the human genome and is discussed in U.S. Patent No. 5,460,941. Similarly, the peptides of the invention may also 
be used for site-specific cleavage of DNA to facilitate the mapping end cloning of complex genomes. 

10 An additional use of the instant peptides is in the generation of transgenic animals and other organisms. A 

particular application would be in the development of transgenic mice. Mutations are generated in embryonic stem (ESI 
cells by homologous recombination between exogenously added DNA and the endogenous chromosomal sequences 
(Mansour, GATA, 7:219-227, 19901. Homologous recombination allows precise targeted insertion of genetic information, 
correction of gene mutations and gene inactivation; however, this homologous recombination is a rare, inefficient event. 

15 These cells ere then used to generate chimeric intermediates that pass the mutant allele through the germ line, initiating 
a strain of mice that carry the desired mutation. This well known technique is disclosed in U.S. Patent 4,736,866. 
Using this method, an extremely large number of transformed ES cells must be screened to determine which cell contains 
the exogenously added DNA. The frequency of precise DNA stable integration at the homologous DNA locus will be 
greatly increased by cotransfection of the instant peptides and the ssONA of interest. The co-transfection of an 

20 exoyenously added DNA molecule in the presence of the claimed peptides into ES cells will improve the speed and 
efficiency of the homologous recombination between the two DNA sequences, resulting in larger numbers of transformed 
ES ceils containing the mutation, thus significantly reducing the number of cells to be screened. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION : 

(i) APPLICANT: GOVERNMENT OF THE UNITED STATES OF AMERICA AS 
REPRESENTED BY THE SECRETARY, DEPARTMENT OF HEALTH AND HUMAN SERVICES 

(ii) TITLE OF INVENTION : Promotion of Homologous DNA Pairing by 
RecA-derived Peptides 

(iii) NUMBER OF SEQUENCES: 9 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Knobbe , Martens, Olson and Bear 

(B) STREET: 620 Newport Center Drive, Sixteenth Floor 

(C) CITY: Newport Beach 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP : 92660 

(v) COMPUTER READA3LE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: U.S. 08/483,115 

(B) FILING DATE: 07-JUN-95 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Altman, Daniel E. 

(B) REGISTRATION NUMBER: 34,115 

(C) REFERENCE/DOCKET NUMBER : NIH115 . 001QPC 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (714) 760-0404 

(B) TELEFAX: (714) 760-9502 



(2) INFORMATION FOR SEQ ID NO : 1 : 

ii! SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE : NO 

(v) FRAGMENT TYPE: internal 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: FECO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



WO 96/40765 



•19- 



PCT/US96/09959 



Asn Gin lie Arg Met Lys lie Gly Val Met Phe Gly Asn Pro Glu Thr 
1 5 10 15 

Thr Thr Gly Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C} STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 



(vii) IMMEDIATE SOURCE : 
(B) CLONE: WECO 



Jxi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Asn Gin lie Arg Met Lys lie Gly Val Met Trp Gly Asn Pro Glu Thr 
15 10 15 

Thr Thr Gly Gly 
20 

(2) INFORMATION FOR SEQ ID NO:3: 

!i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii; MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE : internal 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: YECO 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

Asn Gin He Arg Met Lys He Gly Val Met Tyr Gly Asn Pro Glu Thr 
15 10 15 

Thr Thr Gly Gly 

20 

(2) INFORMATION FOR SEQ ID NO:4: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: HECO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Asn Gin lie Arg Met Lys lie Gly Val Met His Gly Asn Pro Glu Thr 
15 10 15 

Thr Thr Gly Gly 
20 

(2) INFORMATION FOR SEQ ID NO : 5 : 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
{iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 
(v) FRAGMENT TYPE: internal 



■vii) IMMEDIATE SOURCE: 
(B) CLONE: AECO 



(>:i) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Asn Gin He Arg Met Lys He Gly Val Met Ala Gly Asn Pro Glu Thr 
1 5 10 15 

Thr Thr Gly Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: peptide 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
(v) FRAGMENT TYPE: internal 

(via) IMMEDIATE SOURCE: 
(B) CLONE: WT-14 
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(xi) SEQUENCE DESCRIPTION : SEO ID NO: 6: 

Arg Met Lys lie Gly Val Met Phe Gly Asn Pro Glu Thr Thr 
1 5 10 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv} ANTI-SENSE: NO 

(vj FRAGMENT TYPE: internal 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: WT-Scr 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

lie Pro Glu Gin Thr Lys Gly Gly Arg Asn Thr Met Asn Val Phe Gly 
15 10 15 

Met Gly He Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



(vii) IMMEDIATE SOURCE : 
(B) CLONE: 57-mer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:B: 
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CTGTCTACTC TCGAGGTTAA CCCGTGCGAA TTCTACGATT GGTGCGGCCG GTATATC 57 

(2) INFORMATION FOR SEQ ID NO: 9; 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL : NO 
(iv) ANTI- SENSE: NO 



(vii) IMMEDIATE SOURCE: 
(3) CLONE: BS-S1 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
GGCCGCTCTA GAACTAGTGG ATCCCCCGGG CTGCAGGAAT TCGATATCAA GCT 53 
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WHAT IS CLAIMED IS : 

1. A peptide containing the amino acid sequence shown in SEQ ID N0:1, or conservative amino acid 
substitutions thereof, wherein the amino acids at positions 1, 2, 4, 19 and 20 of SEQ ID NO: 1 are asparagine, 
glutamine, arginine, glycine and glycine, respectively, said peptide being capable of promoting paring of a single-stranded 
ffldA molecule and a double-stranded DMA molecule, wherein sard single-stranded ONA molecule is homologous to at least 
a portion of said double-stranded DNA molecule. 

2. The peptide of Claim 1, wherein: 
position 1 is asparagine; 
position 2 is glutamine 

10 position 3 is isoleucine, valine, tyrosine, threonine, leucine or methionine; 

position 4 is arginine 

position 5 is isoleucine, phenylalanine, valine, tyrosine, threonine, tryptophan, leucine, cysteine, 
methionine, histidine, arginine, glutamic acid, glutamine, serine, asparagine or alanine; 

position 6 is isoleucine, valine, threonine, leucine, methionine, histidine, arginine, lysine glutamine, 
15 serine, asparagine, alanine or glycine; 

position 7 is isoleucine, phenylalanine, valine, tyrosine, leucine, cysteine, or methionine; 

position 6 is valine, tyrosine, threonine, tryptophan, leucine, cysteine, methionine, histidine, arginine, 
lysine, glutamine, serine, asparagine, alanine, proline or glycine; 

position 9 is isoleucine, phenylalanine, valine, tyrosine, threonine, tryptophan, leucine, methionine, 
20 histidine, arginine, lysine, aspartic acid, glutamic acid, glutamine, serine, alanine, proline or glycine; 

position 10 is phenylalanine, valine, tyrosine, threonine, tryptophan, leucine, cysteine, methionine, 
histidine, lysine, aspartic acid, glutamic acid, glutamine, serine, alanine, proline or glycine; 

position 11 is isoleucine, phenylalanine, valine, tyrosine, threonine, tryptophan, leucine, cysteine, 
methionine, histidine, lysine, glutamine, serine, asparagine, alanine, proline or glycine; 
25 position 12 is threonine, cysteine, methionine, histidine, lysine, glutamine, serine, asparagine or glycine; 

position 13 is 1h re on me, cysteine, aspartic acid, serine, asparagine, alanine or proline; 

position 14 is isoleucine, phenylalanine, valine, tyrosine, threonine, tryptophan, leucine, cysteine, 
methionine, histidine, arginine, lysine, aspartic acid, glutamic acid, glutamine, serine, asparagine, alanine, proline 
or glycine; 

3 0 position 15 is tryptophan or glutamic acid; 

position 16 is isoleucine. phenylalanine, valine, tyrosine, threonine, cysteine, methionine, aspartic acid, 
glutamine, serine, asparagine or alanine; 
position 17 is threonine; 

position IB is valine, threonine, cysteine or serine; 
35 position 19 is glycine; and 

position 20 is glycine. 
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3. The peptide of Claim 1, wherein: 
position 1 is asparagine; 
position 2 is glutamine; 

position 3 is isoleucine, tyrosine, threonine, tryptophan, leucine, or methionine; 
5 position 4 is arginine; 

position 5 is isoleucine, phenylalanine, valine, tyrosine, threonine, tryptophan, leucine, cysteine, 
methionine, histidine. glutamine, serine or asparagine; 
position 6 is arginine or lysine; 
position 7 is isoleucine, phenylalanine, leucine or methionine; 
1C position 8 is vaine, tyrosine, threonine, leucine, methionine, histidine, arginine, lysine, glutamic acid, 

glutamine, serine, asparagine, alanine or glycine; 

position 9 is isoleucine, phenylalanine, valine, tyrosine, threonine, leucine, cysteine, methionine, 
histidine. glutamine, serine, asparagine or proline; 

position 10 is valine, tyrosine, threonine, tryptophan, cysteine, methionine, histidine, lysine, glutamine, 
15 serine, asparagine, alanine or glycine; 

position 11 is isoleucine, phenylalanine, valine, tyrosine, tryptophan, leucine, cysteine, methionino or 

histidine; 

position 12 is cysteine, histidine, serine or alanine; 

position 13 is cysteine, serine or alanine; 
20 position 14 is isoleucine, phenylalanine, valine, tyrosine, threonine, leucine, cysteine, methionine, 

histidine, glutamine, serine, asparagine, alanine, proline or glycine; 

position 15 is tryptophan or glutamic acid; 

position 16 is isoleucine, valine, threonine, cysteine or asparagine; 

position 17 is threonine; 
25 position 18 is valine, threonine, cysteine, serine or proline; 

position 19 is glycine; and 

position 20 is glycine. 

4. The peptide of Claim 1, wherein said peptide has the amino acid sequence shown in SEQ 10 NO: 1. 

5. The peptide of Claim 1, wherein said peptide has an amino acid sequence selected from the group 
insisting of SEQ ID NO: 2 and SEQ ID NO: 3. 

6. A method for selectively targeting a single-stranded DNA molecule to a double-stranded DNA molecule 
containing at least a portion of a sequence homologous to said single-stranded DNA molecule, comprising contacting said 
single-stranded DNA molecule and said double-stranded DNA molecule in the presence of a peptide according to Claim 
1. 

3 5 7. The method of Claim 6, wherein said peptide has an amino acid sequence selected from the group 
consisting of SEQ ID N0:1, SEQ ID N0:2 and SEQ ID N0:3. 
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8. The method of Claim 6, wherein said the targeted DNA region of said double-stranded DNA molecule 
encodes a mutant form of a protein. 

9. A method of inhibiting transcription of a specific gene sequence present on one strand of a double- 
stranded ONA molecule comprising contacting 3 peptide with said double-stranded DNA molecule and with a single- 
branded DNA molecule, which single-stranded DNA molecule is homologous to at least a portion of said gene sequence 
and hybridizes therewith, wherein said peptide is a peptide according to Claim 1. 

10. The method of Claim 9, wherein said peptide has an amino acid sequence selected from the group 
consisting of SEQ ID N0:1, SEQ ID N0:2 and SEQ ID N0:3. 

11. The method of Claim 9, wherein the specific gene sequence on said double-stranded DNA molecule 
lB&codes a mutant form of a protein. 

12. The method of Claim 9, wherein the specific gene sequence on said double-stranded DNA molecule 
encodes an oncogene. 

13. The method of Claim 9, wherein the specific gene sequence on said double-stranded DNA molecule 
encodes a viral transcript. 

15 14. The peptide of any one of Claims 1-6, for use in medical treatment 
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FECO NglRMKIGVMFGNPETTTGG 

WECO NQIRMKIGVMWGNPETTTGG 

YECO NQIRMKIGVMYGNPETTTGG 

HECO NglRMKIGVMHGNPETTTGG 

AECO NQIRMKIGVMAGNPETTTGG 
WT- 14 RMKIG VMFGNPETT 

WT-Scr iPEQTKGGRNTMNVFGMGIT 
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